FGP: A Virtual Machine for Acquiring Knowledge from Cases

نویسندگان

  • Scott Fertig
  • David Gelernter
چکیده

Large case databases are numerous and packed with information. The largest of them are potentially rich sources of domain knowledge. The FGP machine is a software architecture that can make this knowledge explicit and bring it to bear on classification and prediction problems. The architecture provides much of the functionality of traditional expert systems without requiring the system builder to pre-process the database into rules, frames, or any other fixed abstraction. Implementations of the FGP machine use similarity-based reminding and the cases themselves to drive the inference engine. By having the system calculate and incorporate a measure of feature salience into its distance calculations, the FGP architecture smoothly copes with incomplete data and is particularly well-suited to weak-theory domains. We explain the model, describe a particular implementation of it, and present test-results for a classification task in three application areas. 1 Introduction The knowledge acquisition bottleneck is still one of Al's major problems. It has engendered commentary from senior AI practitioners for years, sparked the current revival of interest in machine learning, and is the primary motivation behind a multi-million dollar project to assemble a massive knowledge base (on the order of 10 8 facts) [Guha and Lenat, 1990]. In recent workshops, many researchers have come to the conclusion that the effective use of large databases is our best hope to break through this bottleneck [Friesen and Golshani, 1989]. Our work is in this vein, and is an attempt to devise a domain-independent software architecture and related algorithms to extract knowledge from a large database of unstructured cases, a term we define below. Cases are similar to feature-vector representations of data, but more general. They may be incomplete; individual cases need to have values for only an arbitrary subset of the universe of features. Cases are not assigned to fixed categories. Rather any feature in the feature-universe is a potential category; the program must be willing to direct its inference process at determining the probable value of any arbitrary feature for a particular case. We say probable because we make no assumption about consistency of cases. Two cases may have identical values along every dimension but one. We adopt this definition because these are the characteristics observed in cases drawn from real-world domains. The question we sought to address is whether the static feature information stored in cases provides enough material to do useful inferencing and learning. The approach we …

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evolutionary Arbitrage For FTSE-100 Index Options and Futures

The objective in this paper is to develop and implement FGP-2 (Financial Genetic Programming) on intra daily tick data for stock index options and futures arbitrage in a manner that is suitable for online trading when windows of profitable arbitrage opportunities exist for short periods from one to ten minutes. Our benchmark for FGP-2 is the textbook rule for detecting arbitrage profits. This r...

متن کامل

Investment Decision Making Using FGP: A Case Study

Financial investment decision making is extremely difficult due to the complexity of the domain. Many factors could influence the change of share prices. FGP (Financial Genetic Programming) is a genetic programming based forecasting system, which is designed to help users evaluate impact of factors and explore their interactions in relation to future prices. Users channel into FGP factors that ...

متن کامل

Analytical evaluation of an innovative decision-making algorithm for VM live migration

In order to achieve the virtual machines live migration, the two "pre-copy" and "post-copy" strategies are presented. Each of these strategies, depending on the operating conditions of the machine, may perform better than the other. In this article, a new algorithm is presented that automatically decides how the virtual machine live migration takes place. In this approach, the virtual machine m...

متن کامل

Communication-Aware Traffic Stream Optimization for Virtual Machine Placement in Cloud Datacenters with VL2 Topology

By pervasiveness of cloud computing, a colossal amount of applications from gigantic organizations increasingly tend to rely on cloud services. These demands caused a great number of applications in form of couple of virtual machines (VMs) requests to be executed on data centers’ servers. Some of applications are as big as not possible to be processed upon a single VM. Also, there exists severa...

متن کامل

A knowledge-based NSGA-II approach for scheduling in virtual manufacturing cells

This paper considers the job scheduling problem in virtual manufacturing cells (VMCs) with the goal of minimizing two objectives namely, makespan and total travelling distance. To solve this problem two algorithms are proposed: traditional non-dominated sorting genetic algorithm (NSGA-II) and knowledge-based non-dominated sorting genetic algorithm (KBNSGA-II). The difference between these algor...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1991